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Abstract 

This article concerns tests for the two-sample location problem when the dimension 
is larger than the sample size. The traditional multivariate-rank-based procedures 
cannot be used in high dimensional settings because the sample scatter matrix is not 
available. We propose a novel high-dimensional spatial rank test in this article. The 
asymptotic normality is established. We can allow the dimension being almost the 
exponential rate of the sample sizes. Simulations demonstrate that it is very robust 
and efficient in a wide range of distributions. 
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1 Introduction 


Nowadays, high-dimensional data have been generated in many areas, such as microarray 
analysis, hyperspectral imagery. The traditional statistical methods, which assume the di¬ 
mension is fixed, may not work in the high-dimensional settings. In the last decades, sta- 
tistican devoted many new methods to deal with high dimensional data. Speci ally, many 
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(2011) 
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(20151 for two-sample tests for means. 
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Chen. Zhane; and Zhone (20101 and Zou et ah (20141 


for testing a specifi c covariance structure, Goeman et ah ( 20061 ). Zhong and Chen ( 2011 1 and 


Feng et ah (1201311 for high-dimensional regression coefficients. 


In this paper, we consider the high dimensional two sample location problem. iBai and Saranadasa 


(1199611 proposed a test statistic by replacing the Mahalanobis norm in Hotelling’s test 
statistic with Euclidi an norm. To allow simultaneous testing for ultra-high di mensional data, 
Chen and Qinl ( 2010ll proposed a test statistic by removing the square term in Bai and Saranadasa 
(1l996l ). However, both these two test statistics are not invariant under scalar transforma¬ 
tion, JC —>■ BX where B is a diagonal matrix. Recently, many scalar-invariant test statistics 
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have been constructed, such as ISrivastava and Dul fl2008h. ISrivasta va. Katavama and Kano 

( 2013 ). Park and Avvala ( 2013 ). Feng et ah ( 2015 ). Gregory et al.l (12015 ). All these methods _ 

are b ased on the multivariate normal assumption or the diverging factor model (iBai and Saranadasal . 
19961) . These assumptions are a little restrictive for application. For example, the multi¬ 


variate t-distribution or mixture of multivariate normal distribution do not belong to them. 
Moreover, the performance of these moment based tests would be degraded for heavy-tailed 
distributions. 

In the traditional hxed dimension circumstance, multivariate sign or rank based methods 
are often used to construct robust tests. Those test statistics are very efficient and distribu¬ 
tion free under mild assumptions, or asymptotic so. However, those classic multivariate sign 
or rank based tests also cannot be directly used in high dimensional data because the scatter 
matrix is not available. Recently, Wang. Peng and Li ( 2ni5[) proposed a high-dimensional 
nonparametric multivariate test for one sample location problem based on spatial-signs. 


Paindaveine and VerdeboutI (120151) also proposed a hi gh-dimensional spatia l sign t est for the 


one sample problem by restricting to spherical cases. iFeng. Zou and Wangl (120151) proposed 


a scalar invariant test statistic based on spatial sign for two sample location problem. They 
show that their proposed test is robust and efficient for a wide range of distributions. How¬ 
ever, to estimate the location parameter, they only can allow the dimension being the square 
of the sample sizes at most. It is too restrictive for application. Thus, we need to propose a 
new robust test procedure to allow for ultra-high dimensional data. 

Multivariate rank based methods also perform very efficient in constructing robust test 
procedures. In this article, we propose a high-dimensional spatial rank test for two sam¬ 
ple location problem. First, we estimate the scale of each variable by spatial rank based 
procedures. Then, we propose our high dimensional spatial-rank test based on leave out 
method. The test statistic is scalar-invariant and treats all the variables in a “fair” way. 
Furthermore, unlike the spatial sign, we do not need to estimate the location parameters 
for spatial rank. Thus, there are no bias term in our test statistic when the dimension in¬ 
creases. So we can allow the dimension being almost exponential rate of the sample sizes. We 
also establ ish its asymptotic norm ality and propose the asymptotic relative efficiency with 
respect to IPark and Avvalal (120131) ’s test. Simulation studies show that our test performs 
better than other moment-based test procedures under heavy-tailed distributions. And when 
the dimension is ultra-high against the sample sizes, our test would be more powerful than 


Feng. Zou and Wangl (120151 ) ’s test because of its conservation. All the detailed proofs are 


given in the Supplementary Material. 


2 High-Dimensional Spatial-rank test 

2.1 The proposed test statistic 

Assume {Xu, • • • , for i = 1, 2 are two independently and identically distributed (i.i.d.) 

random samples from p-variate elliptical distribution with density functions 

det(Si)-^/2^(||S"^/^(x - ^Jll) 
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where are the symmetry centers and Sj is the positive dehnite symmetric p x p scatter 
matrix. Here we consider the equal scatter matrix case, i.e. I] = Si = S 2 . We wish to test 


Ho : /Xi = pb 2 versus Hi : pL^. 


( 1 ) 


Hotelling’s test statistic Hn = — X 2 )^S”^(Xi — X 2 ) is the classic method to 

deal with this two sample location problem, where n = ni +^ 2 , Xj are the sample mean and 
S„ is the pooled sample covariance matrtix. However, it is not very efficient for the heavy¬ 
tailed distributions. Multivariate sign or rank bas e d me thods are often used to construct 
robust test for these location testing problem fjOial. I 2 OIOI) . Dehne the spatial sign function 
t/(x) = x/||x||J(x ^ 0). When the dimension p is hxed, the spatial rank test using inner 
standardization is 


= np 


Y.]=ini\\Vi 






where Vi = Si=i ~ ^ Sfchi U{S — Xu.)) and the full rank transfor¬ 
mation matrix satisfy 


2 n,: 


n 


( 2 ) 


i=l j=l 


In traditional hxed p circumstance, is affine-invariant and very robust. When the dis¬ 
tribution is heavy-tailed, is more efficient than the classic Hotelling’s test. However, 
when the dimension p is larger than the sample size n, the scatter matrix is not available 
and then is not well dehned. Alternatively, we could estimate the diagonal matrix of S 
which is able to treat all the variables in a “fair” way. 

For the sample similar to ([2]), we suggest to hnd a diagonal matrix Vi satisfy 


diag 



oc Ip, 


where 


RiVp'^X.j) = 1 ^ U(vp'\x„ - X„)). 


k=l 


Thus, we adopt the following recursive algorithm 


Vi ^ Py^diag 



pV2 p - 

■ ’ ' 


■V.. 


The resulting estimators of diagonal matrix are denoted as Dj. We may use the sample 
variances as the initial estimators. Note that we hxed tr('Di) = p in this algorithm. Since 
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t/((jx) = t/(x), without loss of generality, we assume tr(S) = p in the following. So Dj is a 
consistent estimator of the diago nal matrix of S , he. D (Lemma ?? in the Appendix). 

A natural idea is mimicking IChen and QinI (120lOl) and considering the following test 
statistic 


Gn = 


E r 

i 


vlv 






+ 




ni{ni - 1 ) 722(^2 - 1 ) 


ni?7.2 


where = n-^ ^Li Eti )) is the sample “spatial-rank” of Xij in the 

total samples. And D is also the estimator of D based on the total samples. However, there 
are two drawb acks. First, there wou l d be a non-negligible bias term in Gn with the growth 
of dimension flFeng. Zou and Wand. 120151) . Second, those terms of — X^j)) 

are useless in detecting the difference between these two samples. Thus, based on leave out 
method and excluding R{ty~^R{Xii — Xij)) terms, we propose the following high dimensional 
spatial rank test statistic 


ni ni n2 n2 


= _ 

- 1) n2(n2 






S^t 


where = ^Di(jj) -|- ^D 2 (s,p, Dipj) and D 2 (s,p are the corresponding estimators of 

the diagonal matrix with leave-two-out samples {Xik}kj^i,j, {X 2 k}k^s,t, respectively. Ob¬ 
viously, Uli — X 2 s)) will be deviate from zero if Hi 7 ^ /X 2 and then we will 
reject the null hypothesis with large values of T„. As shown later, the expectation of T„ is 
asymptotic negligible compared to its s tandard deviation under Hq. We do not need a bias 
correction procedure ( Feng et ahl . 2015 ). Moreover, the value of remains unchanged for 
Xij = DXjj -|- c, where D = diagjdf, • • • , dp}, di are non-zero constants and c is a constant 
vector. Our proposed test statistic is invariant under location shift and the group of scalar 
transformations. 


2.2 Theoretical Results 

We need the following conditions for asymptotic analysis: as ? 7 ,,p —)■ 00, 
(Cl) rii/n —)■ K G (0,1); 

(C2) tr(R4) = o(tr2(R2)) where R = 

(C3) log(p) = o(n) and tr(R^) — p = o{n~^p^)-, 


Condition (C2) is the same as the condition (4) in iPark and Awalal (120131) . Condition (C3) 
is used to get the consistency of the diagonal matrix estimators. To reduce the the difference 
between Y)~^R{Xij — and £ij = Yi~^R{Xij — /j,-), we require the correlation between 
those variables to be not very strong and the dimension to be higher enough. 

First, we establish the asymptotic null distribution of T„. 
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Theorem 1 Under Conditions (C1)-(C3) and Hq, as {p,n) —)■ oo 


Tjan^N{0,l), 


where al = + 2 n,{nl-i)p^ + Tckc) 

In order to construct the test procedure, we still need to estimate tr(R^). Here we proposed 
the following three ratio-consistent estimator of tr(R^), s = 1,2, 


tr(R2)^ 


2p2 

pF 


-X 

/ ^ \ 5 ( 21 , 22 , 23 ,^ 4 )^ 




522 

T 


5(21,22,23,24) V 5^4, 


tr(R2) 


P 


2 221 221 222 222 


^ ___ 

^ ^ il^*2 *37^*4 


U{D~F^. . .UXsu-Xsu 

5(21,22,23,24) V 




where 05 ( 4 ,^ 42 , 13 ,* 4 ) the corresponding estimators of diagonal matrix with leave-four-out 

* 

samples {X 5 j}fc^ 4 ,^ 42 , 43,44 . Throughout this article, we use X] to denote summations over 
distinct indexes. For example, in tr(R2)^, the summation is over the set {ii 7 ^ *2 7^ *3 7^ * 4 }, 
for all ii, i 2 , * 3 , *4 ^ { 1 ; • ■ ‘ > ''^ 1 } and P™ = n\/{n — m)\. 

Proposition 1 Under Condition (C1)-(C3), as {p,n) —)■ 00 , we have 


5^)s g 4 
tr(R2) 


s = 1,2,3. 


Here we use the following ratio-consistent estimator for a^. 


= 


2ni{ni — l)p‘ 


dr(R2)^ -1 


2n2{n2 - 1) 


tr(R2)2 + 


nin2P 


;'tr(R2) 


3- 


This result suggests rejecting Hq with a level of signihcance if Tn/^n > where Za is the 
upper a quantile of iV(0,1). 

Next, we consider the asymptotic distribution of under the alternative hypothesis 

(C4) - M 2 ) = C>(co V,,), (Mi - M 2 )^D-i/ 2 rd-V 2 (^^ _ = o{npcF(Tn) 

where cq = E{\\D-P\X,, - X4a,)||-1). 

Condition (C4) require the difference between and P 2 is not large so that the variance of 
T 44 is still asymptotic cr^. And then we can propose the explicit power expression of our test. 


Theorem 2 Under Conditions (C1)-(C4), as {p,n) —)■ cxd, we have 


Tn - cg(pi 
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As a consequence, the asymptotic power of our proposed test (abbreviated as SR here¬ 
after) becomes 


/3sr(||Ri - M 2 II) = + 


2 cgpnK(l -- ^2)^0 HR1-R2) 

v/2tr(R2) 


In comparison, Park and Ayyalal ( 2013 ) show that the asymptotic power of their proposed 
test (abbreviated as PA hereafter) is 


/5pa(||Ri - M 2 II) = ® + 


npK{l - - ^^2) 0 1 (/Xi - /X2) 


E(||£,,-||2)y2R(^ 

The asymptotic relative efficiency (ARE) of SR with respect to PA is 

ARE(SR,PA) = 2clE{\\e,,\\^) ^2{E(||si - S2\\-^)yE{\\e\\^) 

={i^(||£i-S2||-')mi|ei-S2|P)>l. 


by the Cauchy inequality and Cq = -E(||£i — £211 ^)(1 + o(l)) under Condition (C4) (see the 
proof of Theorem 1). If ||£i — £ 2 |p/-E(||£i — £ 2 !^) A 1, SR is equivalent to PA. Otherwise, 
our SR test would be more efficient. 

The ARE values for multivariate t-distribution with i/ = 3,4, 5, 6 are 1.98, 1.48, 1.31, and 
1.22, respectively. Clearly, the SR test is more powerful than PA when the distributions are 
heavy-tailed {u is small), which is also verihed by simulation studies in Section 3. 


3 Simulation 

Here we report a simulation study designed to evaluate the performance of the proposed SR 
test. All the simulation results are based on 2,500 replications. We consider the following 
hve scenarios: 

(I) Multivariate normal distribution. Xij ~ 

(II) Multivariate normal distribution with different component variances. X^j ~ 

where S = and D = diagjd^, • • • , d^}, d'j = 3, j < p/2 and d'j = 1, j > p/2. 

(III) Multivariate t-distribution ^^ 3 . Xj^’s are generated from with S = R. 

(IV) Multivariate t-distribution with different component variances. Xj^’s are generated 
from and d^’s are generated from xl- 

(V) Multivariate mixture normal distribution MNp^..y^ 9 . Xj^’s are generated from R)-l- 

(1 — x)fp{fj,i, 9R), denoted by MNp^..y^ 9 , where /p(-; •) is the density function of p-variate 
multivariate normal distribution. 7 is chosen to be 0 . 8 . 
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Table 1: Empirical Size and power comparison at 5% significance when p < n. 


{npp) 

Size 

(30,24) (40,32) 

Dense Case 
(30,24) (40,32) 

Sparse Case 
(30,24) (40,32) 

Scenario 

TR 

SR 

TR 

SR 

TR 

SR 

TR 

SR 

TR 

SR 

TR 

SR 

(I) 

1.6 

6.3 

1.3 

5.4 

27 

63 

41 

96 

42 

58 

63 

96 

(11) 

0.7 

6.5 

1.8 

5.2 

27 

64 

43 

98 

42 

58 

69 

96 

(HI) 

1.4 

6.4 

0.8 

5.7 

19 

50 

31 

82 

29 

45 

46 

81 

(IV) 

1.2 

4.9 

1.1 

5.3 

51 

60 

72 

91 

29 

54 

54 

59 

(V) 

1.2 

5.3 

0.9 

4.5 

18 

45 

25 

78 

24 

44 

43 

76 


First, we consider the low dimensional case p < n and compare the SR test with the 
traditional spatial-rank-based test (abbreviated as TR). The common correlation ma¬ 
trix is R = For power comparison, we consider the same confignrations of Hp. 

p =: ||D-V2(^^ ~R 2 )IP/■\/tr(R^) = 0.5. Withont loss of generality, nnder Hi, we fix = 0 
and choose ^^2 follows. The percentage of pn = /i 2 i for / = 1, • • • ,p are chosen to be 95% 
(Sparse Case) and 50% (Dense Case), respectively. At each percentage level, all the nonzero 
P 2 i are eqnal. Two combinations of {ni,p) are considered: (30,24) and (40,32). Tabled] 
reports the empirical sizes and power of these two tests. The empirical sizes of TR is signif¬ 
icantly smaller than the nominal level. However, onr SR test can control the empirical sizes 
in most cases. In addition, onr SR test i s more powerfnl than the T R test in all cases. This 
findings are consistent with the resnlts in iBai and Saranadasal (jl99fil ). Classical Mahalanobis 
distance may lose efficiency because of the contamination bias in estimating the covariance 
matrix with large p. When p/n ^ c E (0,1), having the inverse of the estimate of the scatter 
matrix in constructing tests would be no longer beneficial. 

Next, we consider the high-dim ensional cases, p > n, and comp are the SR with the tests 
pro posed by IChen and Qid fl2010ll (abbreviated as CQ hereafter), jPark and Avvalal (1201311 
and Feng. Zou and Wang ( 20151 ) (abbreviated as SS hereafter). The sample size is chosen 
as rii = n 2 = 20. Four dimensions p = 100,200,400,800 are considered. The other settings 
are all the same as low dimensional cases. Table [2] reports the empirical sizes and power of 
these four tests under normal and non-normal cases. The sizes of CQ, PA and SR tests are 
generally close to the nominal level under all the scenarios. In contrast, the sizes of the SS 
test are a little smaller than 5%, i.e., too conservative when pjr? is large. It is not strange 
because SS test can only allow the d imension p being the square of the sample size n. As 
shown in iFeng. Zou and Wang] (120151) . when pjr? is larger, there would be a non-negligible 
bias term in SS test statistic because of the estimation of location parameters. However, 
our SR test can allow the dimension being almost the exponential rate of the sample sizes. 
Consequently, our SR will be more powerful than the SS test when p/ti? is large. Moreover, 
under the normal cases (Scenarios (I) and (H)), SS and SR tests perform similar to PA test. 
However, under the non-normal cases (Scenarios (HI)-(V)), both SS and SR tests are clearly 
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more efficient than PA and CQ tests. It is consistent with the theoretical resnlts in Section 
2. Our SR test is more robust and efficient when the distribution is heavy-tailed. Finally, 
SS, PA and SR tests are more powerful than CQ test when the variance of each variables are 
not equal (Scenario (II) and (IV)), which demonstrates that a scalar-invariant test is needed. 

Next, we also consider the unequal scatter matrix case, i.e. Si 7 ^ S 2 . Now, we consider 
Ri = (0.5l*“-^l) and R 2 = Ip in this study. The other settings are all the same as the above 
equal scatter matrix cases except that rj =: —^ 2 )!^/■\/tr(Rf) = 0.5. Here we only 

consider two combinations of {ni,p): (20, 200), (20, 800). We report the simulation results in 
Table [3l Our SR test can also control the empirical sizes under this unequal scatter matrix 
assumption. The other results are all similar to Table [2j SS is still a little conservative. Our 
SR still performs better than the other tests in most cases. It shows that our SR test can 
also be used in the unequal scatter matrix cases. 

Finally, to study the effect of correlation matrix on the proposed test and to further 
discuss the application scope of our method, we explore another four scenarios with different 
correlations and distributions. The following moving average model is used: 

^ijk = llPill ^{PilZij -|- Pi2-Z’i(j+1) + • • • + PiTi-^iO'+Ti-l)) + Pij 

for i = 1,2, j = 1, ■■■ ,ni and k = 1 , • • • ,p where = (p^i,... ,PiTi)^ and {Zijk} are i.i.d. 
random variables. Consider four scenarios for the innovation {Zijk}: 

(VI) All the {Zyfcj’s are from N{0, 1); 

(VII) the hrst p/2 components of are from centralized Gamma( 8 ,l), and the others 

are from N{0, 1). 

(VIII) All the {Zjjfc}’s are from G; 

(IX) All the {Zjjfcl’s are from 0.8V(0,1) -|- 0.2A^(0, 9). 

The coefficients {pii}Jii are generated independently from U{2,3) and are kept hxed once 
generated through our simulations. The correlations among Xijj. and Xiji are determined by 
\k — l\ and Ti. We consider the “full dependence” for the hrst sample and the “2-dependence” 
for the second sample, i.e. Ti = p and T 2 = 3, to generate different covariances of Xij. For 
simplicity, set p =: |— P 2 \\'^/ A/tr(A^) -|- tr(A|) = 0.1 where Aj is the covariance matrix of 
Xij and {ni,p) = (20, 200), (20, 800). Table 0] reports the simulation results under these four 
non-elliptical distributions. Our SR test also performs well in these cases. The empirical 
sizes of SR are close to the nominal level. The power of SR test is still a little larger than 
PA and SS in most cases. It also shows the robustness of our SR test. 

All the above simulation results show that our SR test is very robust and efficient test 
procedure in a wide range of distributions. SR performs better than the other tests based 
on the direct observations when the distribution is heavy-tailed. In addition, when the 
dimension is larger than the square of sample sizes, our SR test is more efficient than SS test 
because of the conservation of SS test in this case. 
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4 Discussion 


In this article, we propose a new test for high dimensional two sample location problem 
based on spatial rank. Compared with the other L 2 -norm-based tests, our proposed test is 
very ro bust and efficient, especi ally for heavy-tailed or skewed distributions. In another di- 
rection, Cai. Lin and Xia ( 20141 ) proposed a test based on max-norm of marginal t-statistics. 
Zhong. Chen and Xu ( 20131) also proposed a L 2 -thresholding statistic. Both these two tests 


can detect more sparse and stronger signals whereas the L 2 -norm-based tests is for denser 
but fainter signals. Developing a spatial-rank-based test for sparse signals is very interest 
and deserves further study. 

In the case of elliptical distributions. 111 allin and Paindaveine f 20021) propose a class of 
tests based on interdirections and pseudo-Mahalanobis ranks when the dimension is fixed, 
which are distribution-free, affine-invariant, and achieve semiparametric efficiency at given 
reference densities. The Hallin-Paindaveine signs and ranks have been successful in many 
problems involving elliptical densities (one and two-sample location; scatter; homogeneity of 
scatter; regression; VARMA dependence; principal components, etc.). How to construct a 
test base on Hallin-Paindaveine signs and ranks for high dimensional data deserves further 
studies. 
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Table 2: Empirical Size and power comparison at 5% significance with equal scatter matrix 

Size Dense Case Sparse Case 



CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 






Scenario (I) 







(20,100) 

6.9 

4.8 

5.0 

6.3 

83 

77 

77 

82 

79 

73 

74 

79 

(20,200) 

5.3 

3.4 

3.3 

5.0 

87 

81 

83 

86 

86 

81 

82 

87 

(20,400) 

5.5 

2.5 

4.1 

4.7 

90 

80 

85 

90 

89 

80 

85 

90 

(20,800) 

5.4 

1.2 

4.0 

4.5 

92 

81 

89 

92 

91 

79 

88 

91 






Scenario (II) 







(20,100) 

7.5 

5.0 

4.3 

6.0 

44 

78 

78 

82 

40 

73 

73 

78 

(20,200) 

4.5 

3.1 

2.5 

4.4 

47 

81 

82 

87 

44 

81 

82 

87 

(20,400) 

6.0 

2.5 

4.1 

5.7 

46 

81 

86 

91 

42 

80 

85 

90 

(20,800) 

6.6 

1.2 

3.8 

5.6 

42 

80 

85 

90 

45 

79 

88 

91 






Scenario (III) 






(20,100) 

5.8 

4.3 

3.3 

6.2 

42 

64 

29 

62 

37 

59 

29 

58 

(20,200) 

4.2 

3.0 

1.5 

4.7 

44 

67 

34 

69 

41 

63 

27 

63 

(20,400) 

6.1 

2.0 

3.9 

5.7 

44 

63 

32 

67 

41 

63 

31 

66 

(20,800) 

4.4 

0.7 

3.7 

5.3 

44 

61 

32 

72 

44 

60 

32 

71 






Scenario (IV) 






(20,100) 

5.8 

4.3 

3.3 

6.3 

11 

68 

33 

67 

70 

70 

33 

69 

(20,200) 

7.5 

3.0 

1.7 

4.4 

85 

73 

33 

73 

13 

65 

28 

65 

(20,400) 

6.4 

2.0 

3.9 

5.7 

10 

64 

30 

69 

90 

62 

29 

66 

(20,800) 

5.7 

0.7 

3.7 

5.2 

9.6 

60 

32 

73 

8.6 

62 

32 

72 






Scenario (IV) 






(20,100) 

7.5 

4.0 

5.5 

5.2 

42 

59 

35 

60 

35 

55 

30 

57 

(20,200) 

6.5 

3.0 

4.5 

5.2 

43 

60 

32 

63 

41 

56 

32 

61 

(20,400) 

6.1 

2.2 

3.7 

5.3 

41 

57 

30 

64 

39 

57 

28 

64 

(20,800) 

5.5 

0.8 

4.0 

5.6 

41 

54 

30 

68 

39 

52 

28 

66 
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Table 3: Empirical Size and power comparison at 5% significance with unequal scatter matrix 




Size 


Dense Case 


Sparse Case 

Scenario 

CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 





(«i, 

P) = 

(20,200) 






(I) 

5.8 

2.8 

4.2 

6.5 

95 

89 

93 

95 

92 

87 

88 

93 

(11) 

6.5 

2.5 

3.4 

5.5 

58 

91 

93 

95 

52 

87 

88 

93 

(III) 

4.4 

2.2 

2.5 

3.9 

55 

79 

43 

77 

49 

74 

36 

72 

(IV) 

6.6 

2.1 

2.6 

4.6 

65 

80 

45 

79 

11 

75 

40 

74 

(V) 

6.1 

2.3 

5.2 

5.1 

53 

73 

39 

74 

49 

72 

38 

75 





(«i, 

P) = 

(20,800) 






(I) 

7.2 

1.1 

5.6 

5.9 

98 

91 

97 

98 

98 

90 

98 

99 

(11) 

6.9 

1.5 

5.2 

5.8 

64 

91 

97 

98 

59 

90 

98 

99 

(III) 

7.3 

1.3 

4.6 

4.6 

56 

73 

36 

85 

52 

68 

41 

80 

(IV) 

6.2 

1.2 

4.1 

4.1 

70 

70 

40 

87 

13 

58 

36 

74 

(V) 

4.4 

1.5 

4.6 

4.8 

52 

60 

35 

81 

51 

60 

39 

77 


Table 4: Empirical Size and power comparison at 5% significance with MV model 




Size 


Dense Case 


Sparse Case 

Scenario 

CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 

CQ 

SS 

PA 

SR 





(^i, 

P) = 

(20,200) 






(VI) 

6.7 

5.8 

6.4 

5.9 

32 

28 

26 

31 

43 

42 

36 

50 

(VII) 

3.9 

4.1 

6.1 

4.8 

23 

32 

31 

33 

38 

72 

69 

76 

(VIII) 

6.3 

3.7 

7.1 

5.1 

41 

34 

35 

41 

42 

44 

34 

53 

(IX) 

6.5 

5.4 

6.3 

5.7 

29 

27 

26 

33 

34 

36 

28 

47 





(«i, 

P) = 

(20,800) 






(VI) 

6.6 

4.8 

6.7 

5.8 

32 

27 

26 

28 

33 

35 

22 

40 

(VII) 

4.1 

4.0 

6.3 

4.7 

37 

43 

38 

46 

38 

72 

68 

74 

(VIII) 

5.9 

3.9 

7.8 

5.4 

39 

32 

31 

42 

41 

29 

31 

49 

(IX) 

6.4 

3.4 

6.2 

5.1 

34 

32 

28 

34 

31 

31 

23 

35 
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1 Proofs of Theorems 


1.1 Proof of Theorem 1 

Define ui* = E{U{£ii — e 2 j)\eii) and U 2 j = E{U{eii - £ 2 j)\e 2 j). Obviously, Ui*, U 2 j have 
the same distribution. And £'(uijU^j) = where the constant tf depend on the 

background distribution E. Dehne 


Y,, =D-V2(X,,- - /xj, V2^ = -E{U{Y^, - Y2^)\Y2^), 

Vu =E{UiYu - Y2,)\Yu),Pis = UiYu - 1^2.) - Vu + F 2 ., 
W,, =U{Yu - Y,,) - Vu - Pn, 

A =E{VuVl) = E{V2,VI). 


Let Dj = diag{(iji, • • • , dip} be th e diagonal matrix of Sj and D* = diagjdji, • • • , dip}. 

First, we restate Lemma 4 in Zou et all ( 2014 ) and propose some useful Lemmas. The 
proof of these Lemmas are given in Appendix B. 


Lemma 1 Suppose u are independent identically distributed uniform on the unit p sphere. 
For any p x p symmetric matrix M, we have 


E(u^Mu) 2 ={tr2(M) + 2tr(M2)}/(p2 + 2p), 

£'(u^Mu)^ ={3tr^(M^) + 6tr(M^)}/{p(p + 2){p + 4)(p + 6)}. 


Lemma 2 Under Condition (Cf), we have maxi<j<p((ijj — dij) = Op{n^ ^^‘^{XogpYF) 


Lemma 3 Ti? —)■ 0.5 as p ^ 00 . 
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Lemma 4 Suppose the eonditions given in Theorem 1 all hold, we have T„ = + Op(cT„), 

where 


ni{ni - 1) 

2 


-l ni ni . n2 n2 


ni n2 






nin2 ^ , 

1=1 j=i 


2i- 


Lemma 5 Suppose the conditions given in Theorem 1 all hold. 


Then, Z^jan 



iV(0,l). 


Proof of Theorem T. According to Lemma H] and [5l we can easily obtain the result. 


□ 


1.2 Proof of Proposition 1 

Taking the same procedure as Theorem 1, we can obtain 

ti^), ^ U{Yu, - Yu,fU{Yu, - Yu,)U{Yu, - Yu,YU{Yu, - Yu,) + 0p(tr(R2)) 

m 


2p2 

ri' 

2p 


Efl'u. - - Vu,){Vu, - Vi.,)'^(Vu, - Vu.) + Op(tr(R")) 


ni 

2 


E(yi«y>»t - w E yi.y^nVu^y 


PI 


PI 


1*3 


+ I^E + ».(“'(R")) 

ni 

=Ji + J 2 P J 3 P Op(tr(R^)). 

Obviously, E{Ji) = 4 pHt{A^). And 

(ufi^Ruii2, 


tT{A^) = E{{ViVu,)^} =E 


_{l + ufi(R-Ip)uij}2_ 

=E {(ufi^RuiiJ^} {1 + Op(l)} 

=r|p“^tr(R^){l + 0 ( 1 )} = 4“V~^ti'(R^){l + o(l)}, 

because A'[{u^j(R — Ip)uij}^] = r|.p“^(tr(R^) — Ip) = o(l) by Condition (C3). So E{Ji) = 
tr(R^){l + 0 ( 1 )}. Taking the same procedure as above, we have 


var 




ni(ni - 1) 


Pj 


= 0 {nfY)E{{Vl,V,,)^) P 0 {nfY)[E{{VlYi,)^}r 
=0(nh^tr(R‘^) +nh^tr^(R^)) = o(tr^(R^)). 


2 












by Lemma [T] and Condition (C2). Similarly, we can show that 

E{Jl) =0{n-,Y)E{{VlAVuf) + 0{n-,YM^^) = 

E{Jl) =0(nrytr(A^)) = (R^)). 

Thus, tr(R2)^ = tr(R^)(l + Op(l)). We can also show the ratio-consistency of the other 
estimators. □ 


1.3 Proof of Theorem 2 

Dehne Uis = U(Yu — Y 2 s) and r^g = 111^14 — 1^2511. Firstly, taking the same procedure as 
Lemma lU we can show that 


ni ni 712 R2 


T 

n 


ni{ni - I)n2(n2 - 1) 

1 1 

ni(ui - 1) n2(n2 - 1) 


zwE YY. -X„)) + 


i^j s^t 
ni ni 712 R2 


Y YY 


i^j s^t 


X U{Yij -Y 2 t + 'D fl^)) + OpicTnj 

^ Til Til 7l2 712 


ni{ni - I)n2(n2 - 1) 

2 1 


E EE E'^('"“ - Y2.fu{Yy - y^,) 


i^j s^t 
Til Til 712 rL2 


+ 


+ 


X 


ni(ui - I)n2(n2 - 1) 

1 1 

ni(ni - 1) 712 (n2 - 1) 


ruE EE E'-i'C, [I, - u,.ui] d-‘/2(4., - 4.,) 


ni ni 7i2 712 


H Yin Y^is^^jt\^i - /^2)^d [Ip - uYl] 


[Ip - UisUl] - M2) + Op{an). 


According to the same arguments as Theorem 1, we have 


Til ni 712 ^2 


77 i( 77 i - 1 ) 772(772 - 1 ) 


ruE EE Ev.‘C, [Ip - U,.ul] - ^ 2 ) 




Or ^ Or ^ 

° - Y - - Y + OpY), 

n2 ^ 

Z=1 


ni ^ 
2=1 


ni ni 712 ^2 


- ^ 2)^0 1/2 _ u.tUjt] 


77i(77i - 1) 772(772 - !)• 

X l^lp U j 

=Co(Mi - M2)^ D-^(/Xi - Y + Op{an). 


i^j s^t 

X [Ip - U 2 .UI] D-V 2 (p, - ^ 2 ,) 

. 2 /., \Tt\-1i 
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where Cq = E{r^^ ). Thus, 


T —7 

^ n — 


I: - M,) - - m2) 


rii ^ n2 

1=1 

+ Co(Ati - - ^ 2 ) + Op{an). 


And 


^ l/2^^1 _ j 1/2 j^J3 1/2('^^ _ 


ni 


rii 

2ci 


Z=1 

7^2 


E — ^ F^.D - ^2) = 0 ( 7 ^ V - ^2)^0 ^/^RD - /X2)). 


7^2 ^ -, 
2=1 


Under Condition (C4), the second and third parts of T„ are all Op(cT„). Then, by Theorem 
1, we have 


Tn - - fi2V'D 1(^1 - ^2) 


(Tri 


iV(0,l). 


Here we complete the proof. 


□ 


2 Proof of Lemmas 


2.1 Proof of Lemma [2] 

Proof. 

-| Rl -| ril 

RiB-^/^Xu) =- V U{Yu - Y,,) = - V(Pi, - V,, + W,j) 

Til rii 

i=i i=i 

1 1 

=Pn--VP2, + - 

ni rii 

^ i=i ^ i=i 

Obviously, = Op(^n-Tr(A2)) and ^ = Op{^fr7Hx(J7)). Thus, 

=U(VHPa (1 + 0(1)) 

=E{E{U{^^l\eu - Si,))f/(R+2(si, - si,))'^|R+2si,))(l + o(l)) 
R^+uijufjR^+ 


=E 


1 + uf.(R - Ip)uH 


(1 + 0 ( 1 )). 
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Thus, by the Cauchy inequality and Tyler expansion, 

E (diag {E } - TFp-%) 

< C4{E{uJ-{K - Ip)uij)2E(diag{R^/^uiiuf.R^/2} - TFP~%yy^^ 

= O ^p“^A/tr(R2) _ ^ o(n“^/^). 

by Condition (C3). The above equation define the functional equation for each component 
of di ‘ ‘ 7 dip^^ 

Tij{E,dij) = Op(n“^/^), (1) 

where Ei is the distribution function of Xij, i = 1,2. Similar to ?, the linearisation of this 
equation produces 

\/{dij \/^i( Tij ( Ejii , dij'j Tiji^Ei, dij^^ T Op(l), 

where Eni is the empirical distribution function of Xij,j = 1, • • • , Uj, Eli j is the corresponding 
Hessian matrix of the functional dehned in ([T]), and 


rii 

L 

i=i 


R{Eni, d,) = I vecl diag V R{-D-E^Xu)R{B~^^^Xuf - tfP 



where Ti^E^i, di) = (Tji(F„j, dn), • • • , Rp^E^i, dip)) and nec(B) means the vector of the diag¬ 
onal matrix of B. For each variance estimator dij, we have 

■s/ni{dij — dij) A iV(0, (fj), 

where (fj is the corresponding asymptotic variance. Dehne (max = niaxi<i<2,i<j<p Cp. 


P ( max {dij - dij) > ^(maxUi ^'^^(logp)^/q 
\l<j<P / 

V 

< y] P (yni{dij - dij) > \/2Cmax(logp)^/^) 
i=i 

= 5^ (l - ^{^CmaxCT~j\logp)^/^)'^ < P (l - $((2 logp) 

i=l 

< , ^ = {47r)-E\\ogp)-E^ ^ 0. 

V dvr log p 

Finally, maxi<j<p{dij - dij) = Op(n”^^^(logp)^/^). □ 
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2.2 Proof of Lemma [3] 

Proof. 

E{£^ei) =E{{ei - SjY'iei - Sk)) 

=E{E{{ei - £jY{ei - £k)\ei)) 

=E{E{\\£i - SjWWsi - ek\\U{£i - £j)'^U{£i - ek)\ei)) 
=E{{E{\\£i - £j\\\£,)f)E{E{U{£i - £jfU{£, - £k)\£i)) 
=E{{E{\\£i - £j\\\£,)y)E{uJu,) = TfE{{E{\\£, - S,|||s,))'). 
In addition, ii^(||£j|p) = Q.bE{\\£i — £j\\^). Thus, we only need to show that 


E{m\e^ 


h)y 


Em 


1. 


Because £i has the elliptical distribution, e* — £j also has the elliptical distribution. Dehne 
the density function of \ \£i — Sj\ \ is fit) = Cpt'P~^g{t) where Cp = yIp/I) • Thus, 


E((E(||£i - £j|||£i))2) _(/ CptPg{t)dty 


EiWei-m^) 


f CptP+^ g{f)dt 


-'p+i 


r^((p + i)/ 2 ) 


By the Stirling’s formula. 


lim 


cpCp+2 r(p/ 2 )r((p + 2 )/ 2 )' 

r(a; + 1) 


= 1 , 


as p —)■ oo, we have 
^2 


'-p+i 


-)■ 


(p - 1)P-^ 


CpCp+2 pP/2(p - 2 )(p-2)/2 

Here we complete the proof. 


x^oo [x/e)^{2'Kx)^i‘^ 

= (1 - p-^)^/'(l + (p - 2 )-^)(p-2)/2 ^ 1. 


□ 


2.3 Proof of Lemma [4] 

Proof. By the Taylor’s expansion, we have 

f/(D-;',%(X,i - - X,,)) 

=U(0-^''\Xu - X2,)Yu(T>-'l^(Xij - X2,)) 

+ f/(D-‘'"(Xi, - X2,)f(Ip - l/(D-‘'^(Xu - X2,))f/(D-‘/='(Xii - X2,)f) 

X (DfebtjD’''' - - X2,)) 

+ U(D-‘''-‘(Xu - X2,)f(Ip - l7(D-^^^(Xi2 - X2,))U(D-^'^(X22 - Xj,))’’) 
X - Ip)£/(D-‘'^(Xi, - X2,)) 

-j- Op(^(Jn)‘ 
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Next, we only show that 


ni ni 712 712 


ni{ni - 1 ) n2{n2 - 1 ) 


E EE _ i^) 

i¥=j 

X U{B-^/\Xu-X2s))=Op{an). 


Similarly, by f/(D ^/“^{Xu - X 2 s)) = Vii- V 2 s + Pis, we have 


ni ni 712 712 


ni{ni - 1 ) n2{n2 - 1 ) 


E EE - If 


i^j s^t 

xU{T>-^P{Xu-X2s)) 


1 


ni ni 


+ 


ni{ni - 1) 

1 




712 712 


n2{n2 - 1) 


ni n2 


Z-nE 




nin2 




i=i j=i 


ni 712 


+ 


+ 


+ 


nin2{ni - 1) 

2 


Z-nEE^^(D,;‘ED'''-i»)Wi 


S=1 
712 Til 


n2ni(n2 - 1) 




S^t 2=1 


721 Til 712 712 


ni(ni - 1) n2{n2 - 1) 


EEEE^^(D(tM.)D'''-ip)^7. 


i^j s^t 

= Qi + Q2 + Qs + Qi + Q5 + Qe- 

Here we only proof E{Q\) = o(ct^). The other parts Q 2 , Q 3 are similar to Qi. And the last 
three past are similar to the following proof of Ti. Taking the same arguments as Lemma [H 


EiQl) 


ni{ni - 1) 

1 

ni(ni - 1) 

1 

ni(ni - 1) 




E (1 + 0 ( 1 )) 


tr(A2)(log(p)/n)(l + 0 ( 1 )) = o{al). 
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Thus, under Hq, we have 


ni ni 712 ^2 


T 

-L n 


ni(ni - 1 ) n 2 (n 2 - 1 ) 


ni ni 


p^sf{y^, + V2t + p,t) + op(an) 




n 2 n 2 


ni(ni - 1 ) 
2 




ij 


ni 712 


n2{n2 - 1 ) 




2 i 


ni n 2 


nin 2 


E E + 


i=l j=l 




+ 


n2ni{n2 - 1 ) 


rTTEE^ 2 '"- + 


ni ni n 2 712 


s^t i=l 

= Zn + Ti + T2 + T3 + Op(an)- 


ni{ni - 1 ) n 2 (n 2 - 1 ) 


z-nEEEE^i^i>+“f(‘"”) 


i^j s^t 


Next, we only show that Ti = Op{an)- T 2 and T 3 are similar to Ti. 


E{T^) =E 


(- 


\ nin 2 (ni - 1 ) 




ij 


4ni(ni - I)n2(n2 - 1) 


i^j S = 1 

E(pyv,,pi:v„) 


n\n\{ni — 1 )^ 

4ni(ni - I)n2(n2 - 1) dT 


E{P^,V,,PlV,,) 


nfn|(ni — 1 )^ 

4 ni(ni - l)(ni - 2)n2 dT 

n^nKni — 1 )^ 

4ni(ni-l)(ni-2)n2 n^ pT^ ^ 

njnl{n,-iy ^(PisVi.PksVu) 

* -E(PZV,,VlP„) + 


-E{Pl.V,,Pj.V,>) 


n,n2(n,-1) ' “ “ nin2(ni - 1) 

-E(PlVi,VlP,.) + - d - 7TE(P;VijPj:Vu), 


nin2{ni - 1 )‘ 


nin2{ni - 1 )‘ 



because 


= EiP^^APu) 

=E{iUiYu - Y2 s) - Vu - V2sfAiUiYu - Y2t) - Vu - V2t)) 

=E{U{Yu - Y2sfAU{Yu - Y2t)) - E{U{Yu - Y2sfAVu) 

- E{U{Yu - Y2tfAVu) + E{VlAVu) 

=E{E{UiYu - Y2 sTAU{Yu - Y2t)\Yu)) - E{E{UiYu - Y2sfAVu)\Yu) 

- E{E{UiYu - Y2tfAVi,)\Yu) + tr(A2) 

=E{VlAV^i) - E{VlAVu) - E{VlAVu) + tr(A2) 

= 0 , 

E{PlV^,PlV,,) = E{PlAP,,) = 0, 

= ii{E{yuPls))\ 

E{P{Y^i)Pl) = EiVuiUiYu - Y2 s) - Vh - V 2 .)) 

= E{VuU{Yu - Y2sf) - E{VuVl) 

= E{E{VuU{Yu - Y2sf\Yu)) - E{VuVl) 

= E{VuVl) - E{VuVl) = 0. 


Next, we will show that E{P^VijVijPis) = E{P^APis) = 0(tr(A^)). In fact, we only 
need to show that E{U{Yii — Y 2 j)'^AU{Yii — Y 2 /)) = 0(tr(A^)). 

U{Yu-Y2,fAU{Yu - Y2 j) 


li 


\\Yu-Y2j\\ 
Yu - Yo 


AUiYu-Y2j) 
AU{Yu 


Yu-Yo + Yo-Y2j 


AU{Yu - Y 2 j) 


Y2j) 

Yo-Y 


2i 


\\Yu-Y2j\\' 
=U{Yu-YofAU{Yu-Y2,)-, 


+ U{Yo-Y2,yAU{Yu-Y2,) 


\\Yu-Y2j\\ 

|yn-yoll 

^n-^2,|| 


AU{Yu - Y 2 j) 
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Additionally, 


e(u{Yu-Y^)AU{Yu-Y2,)- 


Yu - Y, 




li 


2jl 


= E ( U{eu - eoYB}/‘^KR^/‘^U{eu - £ 2 ,- 


= E (^17(£i, - e^YY{}/‘^KR}/‘^U{eu - 

= 0(tr^(A^)). 


|£ii — £o| 
|£li — £2j| 

kli ~ £o| 


(\ 

£ij ~ ^0 

III 

£li — £2j 


I|£li ~ £2j| 

( 1 + 0 ( 1 )) 


(1 + 0 ( 1 )) 

(1 + 0 ( 1 )) 


Similarly, we can show another part is also 0(tr(A^)). Thus, E{U{Yii — Y 2 jY'AU{Yii — 
Y 2 j)) = 0(tr(A^)). Then, we obtain that + Op(cT„). □ 


2.4 Proof of Lemma [5] 

Proof. Let Ui = Vu for z = 1,..., rii and Uj+ni = V2j for j = 1,..., n2 and for i 7^ j, 

r nh^(rii - l)-^U^Uj, i,j e {1,2,...,ni}, 

(t>ij = I z e {1,2,... ,ni}, j G {rzi + 1,... ,zz}, 

[ n^^(zz2 - ly^UjUj, z, j e {rzi + 1, rzi + 2,..., n}. 

Dehne Z^j (pij for j 2 , 3 ,..., zz, Snm ^n-j nnd Enm ^^{C/i, U21 • • • 1 

which is the cr-algebra generated by {Ui, U2, ■ ■ ■, Um}- Now 

n 

■^n = 2 Znj- 
1=2 


We can verify that for each n, {Snm, Enm}m=i is the sequence of zero mean and a square 
integrable martingale. In order to prove the normality of Z^, according to Hall and Heyde 
(1980), it suffices to show the following two results: 


^ 4’ 

n 

1=2 


( 2 ) 

( 3 ) 
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First, we proof result ([2]). Note that 


'i-i 


E[ZUr„,-,] = { (5; UJU, ) \F, 


n,j-l 


< 2=1 
J-1 


n'jinj - 1) 

1 

h‘^j{hj - 1)2 
1 

h'^Ahj - 1)2 


iE{ UlUjUjU,,\F, 


n,j-l 


V 21,22 = 1 


2 Uj^E{UiVj\T„,_,)U. 

i: uiAu. 
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ni 


*1,*2 = 1 


where Uj = rii, for j G [l,Ui] and Uj = ^ 2 , for j G [ui,n]. Dehne rj^ = Yl ^=2 ^[^nj\^n,j-i] 
By some tedious algebra, we can obtain that Eijin) = iO'n(l + o(l)). 


Now write E{r]^) as 


E(,i) =E \ Y, 


i-1 


j=2 1 


E UlAU,, 


^^e{ ± 

U<ii< 

+^Ie 


(hj - 1)2 

1 1 

- ^y^%inj2 - 1)^ 

1 


E E uIau^MAu 


*4 


J2' 

1-1 1-1 


._ -4 

j—2 J 


W^iflj — l)'^ 


5^ 5 ^ uIau„uIAUu 


21,22 = 1 23,24 = 1 


—Li + L 2 . 

Consider the hrst part Li. 


E i ^ _ ^ 

I 77,2 (h,-, — 1)2 h^jAflA — 1)^ 

v2<Ji<J2 *1,*2 = 1 *3,*4=1 

ll-l I 2 -I 


ll-l I 2 -I 


UlAU„UlAUi, 


4 E 


Tl ■ 

f2<ii<i2 11 


(hli - 1)2 h|(h,2 - 1)2 ^ 


54 54 C/fAC/.C/rAC/, 


ll-l j2-l 


^ ^ 77.2 (774, — 1)2 772 (77,2 — 1)2 

.2<ii<i2 111 -11 7 j2\ n > 

** 1 1 I 1 - 1 I 2-1 


EE'iAHi.'iEAt/i, 


+ C E 


Tl ■ 

2<ll<l2 11 

=Lii + Li 2 + F 13 . 




J^J^C/jAC/i.C/jAC/i, 
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Taking the same procedure as Lemma 5 and some tedious calculations, we can verify that 
Ln = o(a^), Li 2 + Li 3 = E^{r]n) and E{Ll) = o{a^). So, var(r 7 „) = E{ril) - E^{t]^) = o{a^). 
This completes the proof of ([3]). 

Next, we proof result (I3|). First of all, we note that 

n n 

i=2 j=2 

Accordingly, the assertion of this lemma is true if we can show 


E 


n 

E 

.i=2 


E[ZU^n,-l] 



Notice that 


E 


i=2 


n 


n 


J2E(Zij)=0{n-’<)J2E 

j=2 j=2 



Similar to lChen and QinI (120101 1. the last term can be decomposed as 3Q + P, where 


n j-1 

Q = 0{n-‘)Y,Y,EiUjU.U';U,UjUtUTUj), 

j=2 s^t 
n j-^ 

j=2 s=l 


Note that 


n j—1 

Q =0(n-*) E(Uj u.u^,u,u] u,uj U,)} 

j=2 s^t 
n j-1 

=0(n-’‘) E E(UJAU,UJAU,) 

j=2 s^t 

=0{n-^)(Tt 


where the last equality is followed by E{(UjAUjY) = 0(tr(A^)). Here we will show it. 

Vu =E{U{Y^i - Y2,)\Yu) = E{U{R^/\ei, - £2,))|R'/'si,) 
=E{U{R^/\eu-e2,))\e, 


f R^^‘^{£ii - £2j) 


-li) 


^V. 
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Because of ||(R^^^ — Ip)(£ii — ^ 2 j)\\'^ = 0(tr(R^/^ — Ip)^) = o{n then 

||R^'^^(^li ~ ^2j)ll =ll(^li ~ ^2j) + — Ip)(£^li — £ 2 j)\\ 

= ||£li — £2 j||( 1 + 0p(l))5 


and Vii = R^/^uij(l + Op(l)). Thus 

E{{UJAU,Y) = E((u?;Ri/2ARVVi)')(l + o(l)) = 0(tr2(A2)). 
Accordingly, we can verify that Q = o{a^). In addition, 


n j-l 

j=2 s=l 
ni j-l 


n ni 


n j-l 

=0(n-*).j5;5;i5(VjC/,)‘+ 5 ; 5;£;(VJ(7,)‘‘+ ^ E(UjU,: 

j=2 s=l ji=ni+l s=l j=ni + l s=ni + l 

=0(n-S)(Pi + P2 + R3). 


As the procedures for handling Pi, P 2 , P 3 are similar, let us only consider P 2 . By Lemma [H 
= 0(tr2(A2) +tr(A4)), and then 0 (n- 8 P 2 ) = o(cr^). Similarly, 0(n-8Pi) = 
o(cr^) and 0{n~^P^) = o(cr^). This completes t he pr oof of (jl]). Thus, according to the 
martingale central limit theorem flHall and Hvdel . llQSOll . we have 


^var(Z„) 


iV(0,l). 


Obviously, 

var(Z„) = -—L.-E([VlV„)^) + ^ + —E([VlVyf). 

ni[ni - 1) n2{n2 - 1) nin2 

So var(Z„) = cr^(l + o(l)). Then we complete the proof of this lemma. □ 


References 

Chen, S. X. and Qin, Y-L. (2010), A Two-Sample Test for High-Dimensional Data with 
Applications to Gene-Set Testing, Ann. Statist., 38, 808-835. 

Hall, P. G. and Hyde, G. G. (1980), Martingale Central Limit Theory and its Applications, 
New York: Academic Press. 

Zou, G., Peng, L, Feng, L. and Wang, Z. (2014), Multivariate-Sign-Based High-Dimensional 
Tests for Sphericity, Biometrika, 101, 229-236. 


13 










